56 research outputs found

    Demographic Inference and Representative Population Estimates from Multilingual Social Media Data

    Get PDF
    Social media provide access to behavioural data at an unprecedented scale and granularity. However, using these data to understand phenomena in a broader population is difficult due to their non-representativeness and the bias of statistical inference tools towards dominant languages and groups. While demographic attribute inference could be used to mitigate such bias, current techniques are almost entirely monolingual and fail to work in a global environment. We address these challenges by combining multilingual demographic inference with post-stratification to create a more representative population sample. To learn demographic attributes, we create a new multimodal deep neural architecture for joint classification of age, gender, and organization-status of social media users that operates in 32 languages. This method substantially outperforms current state of the art while also reducing algorithmic bias. To correct for sampling biases, we propose fully interpretable multilevel regression methods that estimate inclusion probabilities from inferred joint population counts and ground-truth population counts. In a large experiment over multilingual heterogeneous European regions, we show that our demographic inference and bias correction together allow for more accurate estimates of populations and make a significant step towards representative social sensing in downstream applications with multilingual social media

    The Integrated Carbon Observation System in Europe

    Get PDF
    Since 1750, land-use change and fossil fuel combustion has led to a 46% increase in the atmospheric carbon dioxide (CO2) concentrations, causing global warming with substantial societal consequences. The Paris Agreement aims to limit global temperature increases to well below 2 degrees C above preindustrial levels. Increasing levels of CO2 and other greenhouse gases (GH6s), such as methane (CH4) and nitrous oxide (N2O), in the atmosphere are the primary cause of climate change. Approximately half of the carbon emissions to the atmosphere are sequestered by ocean and land sinks, leading to ocean acidification but also slowing the rate of global warming. However, there are significant uncertainties in the future global warming scenarios due to uncertainties in the size, nature, and stability of these sinks. Quantifying and monitoring the size and timing of natural sinks and the impact of climate change on ecosystems are important information to guide policy-makers' decisions and strategies on reductions in emissions. Continuous, long-term observations are required to quantify GHG emissions, sinks, and their impacts on Earth systems. The Integrated Carbon Observation System (ICOS) was designed as the European in situ observation and information system to support science and society in their efforts to mitigate climate change. It provides standardized and open data currently from over 140 measurement stations across 12 European countries. The stations observe GHG concentrations in the atmosphere and carbon and GHG fluxes between the atmosphere, land surface, and the oceans. This article describes how ICOS fulfills its mission to harmonize these observations, ensure the related long-term financial commitments, provide easy access to well-documented and reproducible high-quality data and related protocols and tools for scientific studies, and deliver information and GHG-related products to stakeholders in society and policy.Peer reviewe

    CT Characteristics of Pheochromocytoma: Relevance for the Evaluation of Adrenal Incidentaloma.

    Get PDF
    BACKGROUND: Up to 7% of all adrenal incidentalomas (AIs) are pheochromocytomas (PCCs). In the evaluation of AI, it is generally recommended that PCC be excluded by measurement of plasma-free or 24-hour urinary fractionated metanephrines. However, recent studies suggest that biochemical exclusion of PCC not be performed for lesions with CT characteristics of an adrenocortical adenoma (ACA). AIM: To determine the proportion of PCCs with ACA-like attenuation or contrast washout on CT. METHODS: For this multicenter retrospective study, two central investigators independently analyzed the CT reports of 533 patients with 548 histologically confirmed PCCs. Data on tumor size, unenhanced Hounsfield units (HU), absolute percentage washout (APW), and relative percentage washout (RPW) were collected in addition to clinical parameters. RESULTS: Among the 376 PCCs for which unenhanced attenuation data were available, 374 had an attenuation of >10 HU (99.5%). In the two exceptions (0.5%), unenhanced attenuation was exactly 10 HU, which lies just within the range of ≤10 HU that would suggest a diagnosis of ACA. Of 76 PCCs with unenhanced HU > 10 and available washout data, 22 (28.9%) had a high APW and/or RPW, suggestive of ACA. CONCLUSION: Based on the lack of PCCs with an unenhanced attenuation of <10 HU and the low proportion (0.5%) of PCCs with an attenuation of 10 HU, it seems reasonable to abstain from biochemical testing for PCC in AIs with an unenhanced attenuation of ≤10 HU. The assessment of contrast washout, however, is unreliable for ruling out PCC

    Multi-ancestry genome-wide association study accounting for gene-psychosocial factor interactions identifies novel loci for blood pressure traits

    Get PDF
    Psychological and social factors are known to influence blood pressure (BP) and risk of hypertension and associated cardiovascular diseases. To identify novel BP loci, we carried out genome-wide association meta-analyses of systolic, diastolic, pulse, and mean arterial BP, taking into account the interaction effects of genetic variants with three psychosocial factors: depressive symptoms, anxiety symptoms, and social support. Analyses were performed using a two-stage design in a sample of up to 128,894 adults from five ancestry groups. In the combined meta-analyses of stages 1 and 2, we identified 59 loci (p value &lt; 5e−8), including nine novel BP loci. The novel associations were observed mostly with pulse pressure, with fewer observed with mean arterial pressure. Five novel loci were identified in African ancestry, and all but one showed patterns of interaction with at least one psychosocial factor. Functional annotation of the novel&nbsp;loci supports a major role for genes implicated in the immune response (PLCL2), synaptic function and neurotransmission (LIN7A and PFIA2), as well as genes previously implicated in neuropsychiatric or stress-related disorders (FSTL5 and CHODL). These findings underscore the importance of considering psychological and social factors in gene discovery for BP, especially in non-European populations

    The Early Growth Genetics (EGG) and EArly Genetics and Lifecourse Epidemiology (EAGLE) consortia : design, results and future prospects

    Get PDF
    The impact of many unfavorable childhood traits or diseases, such as low birth weight and mental disorders, is not limited to childhood and adolescence, as they are also associated with poor outcomes in adulthood, such as cardiovascular disease. Insight into the genetic etiology of childhood and adolescent traits and disorders may therefore provide new perspectives, not only on how to improve wellbeing during childhood, but also how to prevent later adverse outcomes. To achieve the sample sizes required for genetic research, the Early Growth Genetics (EGG) and EArly Genetics and Lifecourse Epidemiology (EAGLE) consortia were established. The majority of the participating cohorts are longitudinal population-based samples, but other cohorts with data on early childhood phenotypes are also involved. Cohorts often have a broad focus and collect(ed) data on various somatic and psychiatric traits as well as environmental factors. Genetic variants have been successfully identified for multiple traits, for example, birth weight, atopic dermatitis, childhood BMI, allergic sensitization, and pubertal growth. Furthermore, the results have shown that genetic factors also partly underlie the association with adult traits. As sample sizes are still increasing, it is expected that future analyses will identify additional variants. This, in combination with the development of innovative statistical methods, will provide detailed insight on the mechanisms underlying the transition from childhood to adult disorders. Both consortia welcome new collaborations. Policies and contact details are available from the corresponding authors of this manuscript and/or the consortium websites.Peer reviewe

    The Early Growth Genetics (EGG) and EArly Genetics and Lifecourse Epidemiology (EAGLE) consortia:design, results and future prospects

    Get PDF

    Genome-wide meta-analysis of 241,258 adults accounting for smoking behaviour identifies novel loci for obesity traits

    Get PDF
    Few genome-wide association studies (GWAS) account for environmental exposures, like smoking, potentially impacting the overall trait variance when investigating the genetic contribution to obesity-related traits. Here, we use GWAS data from 51,080 current smokers and 190,178 nonsmokers (87% European descent) to identify loci influencing BMI and central adiposity, measured as waist circumference and waist-to-hip ratio both adjusted for BMI. We identify 23 novel genetic loci, and 9 loci with convincing evidence of gene-smoking interaction (GxSMK) on obesity-related traits. We show consistent direction of effect for all identified loci and significance for 18 novel and for 5 interaction loci in an independent study sample. These loci highlight novel biological functions, including response to oxidative stress, addictive behaviour, and regulatory functions emphasizing the importance of accounting for environment in genetic analyses. Our results suggest that tobacco smoking may alter the genetic susceptibility to overall adiposity and body fat distribution.Peer reviewe

    A principal component meta-analysis on multiple anthropometric traits identifies novel loci for body shape

    Get PDF
    Large consortia have revealed hundreds of genetic loci associated with anthropometric traits, one trait at a time. We examined whether genetic variants affect body shape as a composite phenotype that is represented by a combination of anthropometric traits. We developed an approach that calculates averaged PCs (AvPCs) representing body shape derived from six anthropometric traits (body mass index, height, weight, waist and hip circumference, waist-to-hip ratio). The first four AvPCs explain >99% of the variability, are heritable, and associate with cardiometabolic outcomes. We performed genome-wide association analyses for each body shape composite phenotype across 65 studies and meta-analysed summary statistics. We identify six novel loci: LEMD2 and CD47 for AvPC1, RPS6KA5/C14orf159 and GANAB for AvPC3, and ARL15 and ANP32 for AvPC4. Our findings highlight the value of using multiple traits to define complex phenotypes for discovery, which are not captured by single-trait analyses, and may shed light onto new pathways

    The trans-ancestral genomic architecture of glycemic traits

    Get PDF
    Glycemic traits are used to diagnose and monitor type 2 diabetes and cardiometabolic health. To date, most genetic studies of glycemic traits have focused on individuals of European ancestry. Here we aggregated genome-wide association studies comprising up to 281,416 individuals without diabetes (30% non-European ancestry) for whom fasting glucose, 2-h glucose after an oral glucose challenge, glycated hemoglobin and fasting insulin data were available. Trans-ancestry and single-ancestry meta-analyses identified 242 loci (99 novel; P < 5 x 10(-8)), 80% of which had no significant evidence of between-ancestry heterogeneity. Analyses restricted to individuals of European ancestry with equivalent sample size would have led to 24 fewer new loci. Compared with single-ancestry analyses, equivalent-sized trans-ancestry fine-mapping reduced the number of estimated variants in 99% credible sets by a median of 37.5%. Genomic-feature, gene-expression and gene-set analyses revealed distinct biological signatures for each trait, highlighting different underlying biological pathways. Our results increase our understanding of diabetes pathophysiology by using trans-ancestry studies for improved power and resolution. A trans-ancestry meta-analysis of GWAS of glycemic traits in up to 281,416 individuals identifies 99 novel loci, of which one quarter was found due to the multi-ancestry approach, which also improves fine-mapping of credible variant sets.Peer reviewe

    New genetic loci link adipose and insulin biology to body fat distribution.

    Get PDF
    Body fat distribution is a heritable trait and a well-established predictor of adverse metabolic outcomes, independent of overall adiposity. To increase our understanding of the genetic basis of body fat distribution and its molecular links to cardiometabolic traits, here we conduct genome-wide association meta-analyses of traits related to waist and hip circumferences in up to 224,459 individuals. We identify 49 loci (33 new) associated with waist-to-hip ratio adjusted for body mass index (BMI), and an additional 19 loci newly associated with related waist and hip circumference measures (P < 5 × 10(-8)). In total, 20 of the 49 waist-to-hip ratio adjusted for BMI loci show significant sexual dimorphism, 19 of which display a stronger effect in women. The identified loci were enriched for genes expressed in adipose tissue and for putative regulatory elements in adipocytes. Pathway analyses implicated adipogenesis, angiogenesis, transcriptional regulation and insulin resistance as processes affecting fat distribution, providing insight into potential pathophysiological mechanisms
    corecore